A Method of Adding New Entries to a Valency Dictionary by Exploiting Existing Lexical Resources

نویسندگان

  • Sanae Fujita
  • Francis Bond
چکیده

Information on subcategorization and selectional restrictions in a valency dictionary is very important for natural language processing in tasks such as monolingual parsing, accurate rule-based machine translation and automatic summarization. However, adding this detailed information is both time consuming and costly. In this paper we present a method of assigning valency information and selectional restrictions to entries in a bilingual dictionary, based on information in an existing valency dictionary. The method is based on two basic assumptions: words with similar meaning have similar subcategorization frames and selectional restrictions; and words with the same translations have similar meanings. Based on these assumptions, new valency entries are constructed for words in a plain bilingual dictionary, using entries with similar Japanese meaning and the same English translations. The measurement of similarity in Japanese is done using paraphrased examples, so that non-expert native speakers can carry out the task. An initial evaluation of 171 new patterns showed that adding them to a Japaneseto-English machine translation system improved the translation for 31% of sentences using these verbs, and degraded it for 8%, a clear improvement in quality.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of a Method of Creating New Valency Entries

Information on subcategorization and selectional restrictions is important for natural language processing tasks such as deep parsing, rule-based machine translation and automatic summarization. In this paper we present a method of adding detailed entries to a bilingual dictionary, based on information in an existing valency dictionary. The method is based on two assumptions: words with similar...

متن کامل

Extending The Coverage Of A Valency Dictionary

Information on subcategorization and selectional restrictions is very important for natural language processing in tasks such as monolingual parsing, accurate rule-based machine translation and automatic summarization. However, adding this detailed information to a valency dictionary is both time consuming and costly. In this paper we present a method of assigning valency information and select...

متن کامل

Automatic Addition of Verbal Semantic Attributes to a Japanese-to-English Valency Transfer Dictionary

The effectiveness of using the semantic attributes of verbs has been shown in various kinds of natural processing systems, such as machine translation systems. The addition of an attribute value is, however, time-consuming and must be performed by hand by an expert on the attribute value. In this paper, two methods for efficiently adding verbal semantic attributes to a Japanese-to-English valen...

متن کامل

An Automatic Method of Creating Valency Entries using Plain Bilingual Dictionaries

In this paper we introduce a fully automatic method to extend an existing rich bilingual valency dictionary by using information from multiple plain bilingual dictionaries. We evaluate our method using a translation regression test, and get an improvement of 7%.

متن کامل

Automatic Valency Derivation for Related Languages

This paper describes an experiment combining several existing data resources (parallel corpora, valency lexicon, morphological taggers, bilingual dictionary etc.) and exploiting them in a task of building a valency lexicon for a related language (Russian) derived from a high quality manually created valency lexicon for Czech (Vallex) containing several thousands of verbs with very rich syntacti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002